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CHAPTER  I:  INTRODUCTION 


Statement  of  the  Problem 

The  task  of  data  analysis  includes  much  more  than  discovering  statis¬ 
tical  significance  among  relationships.  There  is  a  need  for  methods  wliich 
analyze  a  set  of  variables  and  interrelationships  as  a  system  —  to  study 
the  structure,  configurations,  and  patterns  within  the  complex  interrelated 
whole.  The  computer  program  CLUSTR  described  in  this  paper  will  be  part 
of  a  system  of  programs  designed  to  study  a  set  of  variables  as  a  system* 
Explaining  the  purpose  of  the  programs,  called  the  DIGRAF  system,  will  help 
to  clarify  the  purpose  of  CLUSTR. 

The  social  scientist  who  studies  behavior  in  the  field  must  study  it 
as  part  of  a  usually  somewhat  stable  system  which  includes  behaviors  and 
parts  of  the  environment.  Behavior  is  maintained  by  a  complex  set  of  be¬ 
haviors  interacting  with  parts  of  the  environment.  It  makes  sense,  then, 
to  study  behavior  as  part  of  a  system  and  to  analyze  the  interrelationships 
within  the  system  or  part  of  the  system. 

When  a  system  does  not  remain  somewhat  stable  over  time  it  ceases  to 
exist  and  is  replaced  with  another  system  of  behaviors  and  environmental 
characteristics.  Social  change,  which  is  by  definition  a  change  in  be¬ 
havior,  reflects  some  sort  of  instability  in  the  system  or  parts  of  the 
system  since  behavior  changes  when  behaviors  and  elements  of  the  environ¬ 
ment  assume  new  relationships.  The  way  to  locate  the  sources  of  instability 
in  the  system,  and  thus  to  understand  change  if  this  analysis  is  correct, 
is  to  analyze  the  old  and  new  relationships  as  systems  (Shelly,  1968). 
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The  author  will  not  be  concerned  in  this  paper  with  defining  a  sys¬ 
tem  or  discovering  what  io  important  in  a  system.  The  DIGRAF  system  is  to 
be  used  to  analyze  the  set  of  data  after  a  system  has  been  defined  and  ap¬ 
propriate  measures  taken. 

The  program  CLUSTR  is  a  clustering  technique  based,  as  is  the  rest 
of  DIGRAF,  on  graph  theory.  Clustering  is  one  of  the  first  steps  in  the 
description  of  a  system  because  it  defines  domains  of  mutual  influence, 
or  groups  of  variables  which  vary  together.  Graph  theory  provides  a  way 
of  taking  into  account  direction  of  influence,  which  is  not  considered  in 
other  methods  of  clustering.  A  brief  review  of  clustering  techniques  and 
graph  theory  will  serve  to  introduce  CLUSTR. 

Clustering  techniques 

Although  some  methods  of  clustering  existed  before  computers,  they 

were  limited  to  small  numbers  of  variables  because  of  the  large  amount  of 

* 

computation  required.  Since  I960  and  the  advent  of  relatively  inexpensive 
computers,  a  whole  new  body  of  computer-related  clustering  techniques  have 
been  developed  which  attempt  to  group  sets  of  data  patterns  into  subsets 
which  are  as  much  "alike"  as  possible.  E&ta  can  be  clustered  according  to 
variables  or  according  to  observations  (classes,  individuals,  etc.)  and 
most  of  the  techniques  can  be  used  with  varying  degrees  of  success  to  clus¬ 
ter  either.  A  cluster  is  a  set  of  data  patterns  (variables  or  observations) 
which  are  closer  together  according  to  some  criteria  than  they  are  to  come 
other  set  of  patterns.  In  the  case  of  variables  (the  clustering  of  which 
is  one  of  the  first  stops  in  the  analysis  of  a  system),  a  cluster  is  a  set 
of  variables  in  which  a  change  in  one  member  affects  or  has  an  appreciable 
probability  of  affecting  some  of  the  other  members  of  the  set  and  has  a 
very  small  probability  of  affecting  the  members  of  another  set  of  variables. 


A  cluster  of  variables,  in  other  v.’orcls,  defines  a  domin  of  influence 
within  the  system  or  subsystem.  Ideally  s  cluster  will  bo  unique  if  the 
data  is  indeed  clustered,  but  no  stable  clusters  will  be  found  if  tho  data 
is  uniformly  placed  in  the  data  space.  Clustering  Is  usually  distinguished 
from  factor  and  principle  component  analysis  in  that  clustering  deals  with 
local  regions  of  the  multidimensional  data  space  rather  than  with  projec¬ 
tions  of  the  data  onto  a  line  or  plane.  Clustering  techniques  generally 
are  quite  complex  and  require  a  great  deal  of  computation  vhich  is  possible 
only  with  tho  use  of  computers,  but  once  programmed  they  can  be  effectively 
utilized  by  persons  having  relatively  little  mathematical  and  statistical 
training.  Although  some  clustering  techniques  are  not  adequate  to  deter¬ 
mine  significance  of  the  data  in  the  statistical  sense,  they  nevertheless 
provide  descriptions  of  the  data  which  are  highly  suggestive  of  new  experi¬ 
ments  and  new  interpretations  vhich  can  lead  to  theory  building.  There  is 
a  great  variety  of  clustering  techniques,  many  of  which  are  still  in  tho 
process  of  development,  and  very  little  work  has  been  done  on  comparing 
end  evaluating  the  techniques  and  on  developing  methods  for  interpreting 
and  examining  the  resulting  clusters;  hov/evor,  the  area  is  developing 
rapidly.  A  few  types  of  techniques  will  be  briefly  described. 

One  group  of  clustering  techniques,  for  instance  those  of  Ball  and 
Hall  (1964),  Tyron  and  Bailey  (1966),  and  Bouigert  (1968),  is  character¬ 
ized  by  sorting  variables  according  to  the  minimum  distance  or  maximum  cor¬ 
relation  betv/een  the  variables  and  an  arbitrarily  chosen  set  of  "cluster 
points".  Tho  assignment  of  variables  to  clusters  as  well  as  the  position 
of  the  cluster  points  are  improved  by  iteration  until  tho  centers  of  tho 
clusters  adequately  describe  the  data  according  to  some  criteria. 

In  another  group  of  programs  (Sokol  and  Sneath,  1963,  for  example), 
tho  closest  single  pair  of  patterns  is  selected  as  a  nucleus  for  a  cluster 


and  other  variables  (or  observations)  are  assigned  to  that  cluster  on  the 
basis  of  their  closeness  to  the  pair  or  the  mean  of  the  pair.  The  process 
io  repeated  for  other  clusters.  This  type  of  clustering  is  especially  use¬ 
ful  for  taxonomic  problems  (clustering  individuals). 

Other  techniques  are  based  on  an  arbitrary  partitioning  of  the  data 
which  is  then  improved  by  switching  variables  (or  observations)  until  the 
best  partition  is  formed.  Another  way  to  cluster  io  to  decompose  the  dis¬ 
tribution  of  the  data  into  separato  normal  distributions. 

None  of  the  techniques  of  clustering  known  to  the  author,  including 
a  few  based  on  graph  theory,  are  directional;  that  is,  none  of  them  toko 
into  account  the  direction  of  influence  between  variables  —  a  considera¬ 
tion  v.'hich  is  of  vital  importance  in  the  analysis  of  a  system.  Directional 
influences  do  exist  in  a  system  and  they  must  be  taken  into  account.  CLUSTtt, 
a  technique  based  on  graph  theory,  is  one  possible  method  which  can  tales 
direction  of  influence  into  account. 

Graph  theory 

A  branch  of  mathematics  known  as  the  theory  of  graphs  was  brought  to 
the  attention  of  social  scientists  in  1953  by  tv/o  mathematicians  who  felt 
that  a  knowledge  of  the  mathematics  of  abstract  structures  would  be  of 
value  to  investigators  interested  in  various  kinds  of  empirical  structures 
(Harary  and  Norman,  1953)*  It  was  created  by  Euler  in  1736  but  remained 
on  isolated  contribution  until  the  middle  l800's  when  interest  was  re¬ 
vived  by  tho  four  color  map  conjecture  (that  only  four  colors  were  neces¬ 
sary  to  color  a  map  so  that  no  tv/o  adjacent  nations  were  tho  some  color). 
Until  recently  tho  major  reference  for  graph  theory  was  a  German  book  (Ko- 
nig,  1936).  Development  has  been  much  more  extensive  in  areas  outside  the 
U.  S.,  especially  in  Eastern  Europe.  The  first  major  reference  in  English 


appeared  in  1S62  (Ore)  and  the  first  specifically  oriented  to  social 
science  in  19&5  (Harary,  Korean,  and  Cartvrright) .  Since  its  introduc¬ 
tion  into  the  social  sciences,  graph  theory  has  proven  to  bo  a  useful 
model  for  investigators  attempting  to  handle  such  patterns  of  relation¬ 
ships  as  those  involved  in  communication  networks,  group  structure,  bal¬ 
ance  theory,  and  sociometric  choice.  Graph  theory  can  also  be  applied  to 
the  description  of  a  system. 

Graph  theory  is  concerned  vrith  the  abstract  notion  of  structure  as 
a  system  of  points  and  directed  lines.  In  the  description  of  an  empiri¬ 
cal  system  of  behaviors  and  environment  v/e  are  also  interested  in  struc¬ 
ture;  that  is,  v.’e  are  interested  in  the  fora  of  the  patterns  of  relation¬ 
ships  v;hich  exist  in  the  system.  The  direction  of  lines  is  important  in 
graph  theory,  as  is  the  direction  of  influence  in  an  empirical  system.  A 
well  defined  concept  in  graph  theory  is  that  of  a  type  of  structure  called 
a  component.  The  components,  \Jhen  interpreted  within  a  system,  represent 
a  natural  grouping  of  variables  based  on  the'  patterns  of  influence  in  the 
system.  Theso  components,  v:hich  fulfill  the  basic  requirement  of  cluster¬ 
ing  that  members  of  a  cluster  covary  ;ri.th  each  other  and  not  Y/ith  membere 
of  another  cluster  and  Y/hich  take  into  account  direction  of  influence,  form 
the  clusters  of  the  program  CLUSTR. 
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CHAPTER  II:  SOME  ELEMENTS  OF  GRAPH  THEORY 


Definitions 

In  order  to  describe  CLUSTR  it  is  necessary  to  introduce  some  basic 
definitions  from  the  mathematical  theory  of  graphs.  A  more  complete  treat¬ 
ment  of  this  theory  may  be  found  in  Structural  Models :  An  Introduction  t_o 
the  Theory  of  Grauhs  (Harary,  Norman,  Cartwright,  l$o5) ,  from  which  the 
terminology  of  this  paper  is  taken. 

A  graph  is  a  finite  collection  of  points,  or  vertices,  together  with 
a  proscribed  set  of  linos  joining  certain  pairs  of  distinct  points.  If  the 
lines  of  the  graph  have  direction  —  that  is,  if  a  end  b  are  distinct  points 
of  a  graph  and  the  line  ab  from  a  to  b  is  not  the  same  as  the  lino  ba  from 
b  to  a  —  then  the  graph  is  called  a  directed  graph  or  digraph  for  short.  A 
directed  path  joining  tho  points  a  and  e  is  a  collection  of  lines  of  the 
form  ab,  bc,...de  which  all  go  in  the  same  direction.  If  tho  collection  of 
lines  join  the  same  points  but  are  not  all  in  the  same  direction,  such  as  ab, 
cb,  . ..,  de,  the  collection  is  called  a  serin." th  from  a  to  c.  There  may  bo 
more  than  one  path  or  semipath  from  one  point  to  another.  A  point  c  is  said 
to  be  reachable  from  another  point  a  if  there  is  at  least  one  path  from  a  to 
e.  In  Figure  1  the  point  w  is  reachable  from  the  point  u  along  the  path  xv, 
w,  wu.  The  point  x,  however,  i3  not  reachable  from  the  point  u  because  they 
are  connected  only  by  a  seaipath  uv,  v/v. 


A  digraph  (or  subgraph)  io  said  to  bo  stronrly  connected  if  every  two 
points  are  mutually  reachablo;  that  is,  if  any  point  can  be  reached  from 
any  other  point  by  following  tho  arrows.  A  maximal  strongly  connected  sub¬ 
graph  is  called  a  strong  component  of  tho  digraph.  The  points  u,  v,  and  w 
of  Figure  1  together  with  the  linos  uv,  uw,  and  vu  make  up  a  strong  compon¬ 
ent  (the  only  strong  component  of  that  digraph).  Likoidco,  a  digraph  is 
said  to  be  vreakly  connected  if  every  two  points  are  joined  at  least  by  a 
semipath;  that  is,  every  point  can  be  reached  if  the  direction  of  the  lines 
is  ignored.  A  maximal  veal&y  connected  subgraph  is  called  a  weak  component 
of  tho  digraph.  The  digraph  of  Figure  1  constitutes  a  weak  component.  No¬ 
tice  that  a  strong  component  is  a  subset  of  a  weak  component.  A  point  or 
lino  can  belong  to  only  one  strong  component  and  to  only  one  weak  component. 
The  components  of  a  digraph,  aro  then  disjoint  sets  and  no  two  components 
of  the  same  type  will  overlap.  For  another  example,  the  digraph  of  Figure 
2  consists  of  two  strong  components;  one  consisting  of  tho  points  1  and  2 
and  the  other  with  the  points  5,  6,  end  7»  The  points  1,  2,  3,  5»  6,  and 
7  belong  to  the  weak  component.  The  discrete  point  4  io  trivially  consid¬ 
ered  a  strong  component  (end  thus  a  weak  component)  because  each  point  is 
said  to  be  reachable  from  itself. 

Empirical  moaning  of  a  cluster 

If  we  think  of  the  points  of  a  digraph  as  corresponding  to  a  set  of 
variables  and  the  lines  between  points  as  corresponding  to  a  measured  in¬ 
fluence  greater  than  a  given  magnitude  of  one  variable  on  another  (the  sign 
of  the  influence  is  not  considered),  then  tho  properties  of  a  digraph  re¬ 
flect  the  structural  properties  of  the  set  of  interrelationships  among  tho 
variables.  A  line  from  point  a  to  point  b,  in  a  digraph  representing  a  sot 
of  variables  and  relationships,  implies  that  variable  a  influences  variable 
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b.  If  point  e  is  reachable  from  point  a  then  variable  a  directly  or  indir¬ 
ectly  influences  variable  e.  If  several  points  are  strongly  connected,  the 
correcp  iding  variables  mutually  influence  each  other. 

The  strong  and  weak  components  of  a  graph  representing  a  set  of  vari¬ 
ables  end  their  interrelationships  indicate  the  existence  of  clusters  which 
have  properties  that  can  be  derived  by  analogy  from  the  components  of  a  di¬ 
graph.  The  variables  belonging  to  a  strong  component  vary  together  either 
in  the  Game  direction  or  in  opposite  directions.  Each  effects  overy  other 
(directly  or  indirectly)  and  each  can  be  expected  to  have  come  predictive 
valuo  for  every  other.  The  variables  belonging  to  the  same  weak  component 
are  all  members  of  interconnected  chains  of  influence,  but  any  one  variable 
cannot  necessarily  effect  every  other  variable  in  the  weak  component.  Any 
two  weak  components  at  any  given  cutoff  level  of  influence  are  entirely  in¬ 
dependent  of  each  other.  The  variables  in  a  strong  component  may  influence 
the  variables  in  another  strong  component,  byt  the  influence  cannot  be  re¬ 
ciprocal.  If  strong  components  are  linked  in  this  way  they  belong  to  the 
same  weak  component.  Figure  3  shows  strong  components  A  and  D  along  with 
the  point  c  which  ore  members  of  the  came  weak  component  which  is  indepen¬ 
dent  of  the  weak  component  containing  the  strong  components  D,  E,  and  F. 


RLguro  3 

If  a  measured  amount  of  influence  above  a  certain  level  (or  bolow  in 
the  case  of  the  negative  correlation)  exists  between  twe  variables  but  the 
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direction  of  the  influence  ic  uncertain,  then  it  is  assumed  the  influence 
occurs  in  both  directions.  If  the  variables  represented  by  a  digraph  all 
have  about  the  same  influence  on  each  other,  then  either  the  variables  will 
all  belong  to  the  same  strong  component  or  else  there  will  be  no  components, 
depending  on  the  cutoff  level  of  influence  chosen.  Either  of  those  outcomes 
is  intuitively  satisfying  since  no  clusters  really  exist  in  the  data. 

Matrix  representation  of  a  dlgranh 

Computation  of  the  strong  and  weak  components  by  computers  is  made  pos¬ 
sible  by  the  representation  of  the  digraph  in  matrix  form.  The  matrix  cor¬ 
responding  to  a  digraph  (called  an  adjacency  matrix)  is  defined  as  a  matrix 
whose  (i,j)th  entry  is  1  if  there  is  a  line  from  point  i  to  point  j,  and  0 
otherwise.  The  diagonal  elements  aro  zero.  The  adjacency  matrix  correspon¬ 
ding  to  Figure  2  is 

1234567 

lfoioooocf 
2  1010000 
3  0000000 
A  a  4  0000000 

5  0110010 

6  0000001 
7  0000100 

Notice  that  the  number  of  entries  of  1  in  the  adjacency  matrix  equals  the 
number  of  lines  in  the  digraph  and  A  is  syrometric  if  and  only  if  for  every 
line  from  k  to  f  there  is  a  line  from  j  to  i, 

A  number  of  matrices  can  be  derived  from  an  adjacency  matrix;  one  of 
these  is  a  reachability  matrix.  The  (i,j)th  entry  of  a  reachability  matrix 
is  1  if  j  is  reachable  from  i,  and  0  otherwise.  The  diagonal  elements  are 
1  since  a  point  is  said  to  be  reachable  from  itself.  The  reachability  ma¬ 
trix  of  Figure  2  is 
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1234567 

1  r  1 11000  0" 

2  1110000 

3  0010000 

R  =  4  000  1  000 

5  1110111 

6  1110111 

7 1110111 

The  reachability  matrix  R  can  be  directly  computed  frora  A  by  taking 
the  Boolean  sum  of  successive  powers  of  A  until  talcing  another  sum  gives 
the  same  matrix*  The  computation  of  the  example  reachability  matrix  as 
well  as  an  explanation  of  tho  matrix  and  Boolean  algebra  involved  may  be 
found  in  Appendix  A, 

Transposing  a  matrix  (interchanging  the  rows  and  columns  so  that  the 
(i,;j)th  entry  in  the  original  matrix  becomes  the  (j,i)th  entry  in  the  trans¬ 
posed  matrix)  has  the  same  effect  as  changing  the  direction  of  tho  lines  in 
the  original  digraph,  or  changing  the  direction  of  the  paths  in  a  reachabil¬ 
ity  matrix.  The  transpose  R'  of  our  reachability  matrix  R  is 

12  3  4  5  6  7 

ljllOOlll 

2  1100111 

3  1110111 

R'  =  4  0  0  0  1  0  0  0 

5  0000111 

6  ooooiii 

7  0000111 


Taking  tho  cross  product  of  two  matrices  is  defined  as  multiplying  the 
corresponding  (i,j)th  elements  of  each  matrix  to  form  the  (.i,j)th  element 
of  the  product.  Since  v/e  are  dealing  only  with  entries  of  0  and  1,  an  entry 
of  1  in  tho  product  indicates  that  the  corresponding  elements  in  the  multi¬ 
plicand  were  also  1.  The  cross  products  of  R  and  R'  has  entries  of  1  if  and 
only  if  the  corresponding  points  are  mutually  reachable.  The  entries  of  1 
in  a  given  row  (or  column)  are  all  members  of  the  same  strong  component.  The 


strong  components  are  circled 


-11- 


1  2  3  4  5  6  7 


RxR' 


Points  1  and  2  form  one  nontrivial  strong  component  and  variables  5»  6,  and 


7  form  another. 


To  compute  the  weak  components  of  the  example  digraph,  recall  that  the 
direction  of  the  lines  is  not  important  in  a  weak  component.  The  first  thing 
to  do,  then,  is  to  symmetricize  the  digraph}  that  is,  whenever  there  is  a 


line  from  i  to  j,  put  in  also  a  line  from  j  to  i.  Symraetricizing  is  easily 


accomplished  by  adding  A  and  A'. 

1234567  1234567 


A+A'  = 


0 

1 

0 

0 

0 

0 

IP 


r 


1 
2 
3 
+  4 

5 

6 

7 


1  2  3  4  5  6  7 


1 

2 

3 

:,4 

5 

6 

7 


The  reachability  matrix  of  the  symmetricized  digraph  gives  those  points 
which  are  reachable  if  the  direction  of  the  lines  is  disregarded  —  those 


points  v/hich  make  up  the  weak  components  of  the  digraph.  The  entries  of  1 
in  a  given  row  (or  column)  give  the  points  v/hich  are  members  of  the  same 


strong  component.  The  example  digraph  has  one  weak  component  consisting  of 
every  point  except  point  4.  Rearranging  the  order  of  the  points  makes  the 


component  easier  to  see. 


1  2  3  4  5  6  7 


1  2  3  4  5  6  7 


1 

2 
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R(A+A«)=  4 
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1110  11 
0  0  0  1  0  0 
1110  11 
1110  11 
1110  11 


1 

1 

1 

0 

1 

1 

1 


0  0  0 
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CHAPTER  III:  THE  PROGRAM  CLUSTR 


The  program  CLUSTR  computes  the  strong  and  weak  components  of  a  digraph 
exactly  as  they  %/ere  computed  in  Chapter  II.  The  relevant  programs,  modi¬ 
fied  to  bo  used  separately,  are  included  in  Appendix  B. 

Input  to  CLUSTR  is  an  adjacency  matrix  based  on  a  similarity  matrix. 

The  level  of  cutoff  above  which  a  measure  of  similarity  is  considered  sig¬ 
nificant  is  the  only  parameter  of  the  clustering  technique.  Usually  several 
adjacency  matrices  are  computed  with  different  values  of  the  parameter  and 
the  resulting  clusters  compared.  In  actual  practice,  the  similarity  matrix 
on  which  an  adjacency  matrix  is  based  is  nondircctional  and  the  relation¬ 
ships  will  be  represented  in  the  adjacency  matrix  as  if  the  variables  mutu¬ 
ally  influence  each  other  unless  some  additional  information  can  eliminate 
x,he  possibility  of  influence  in  one  direction.  Some  variables  such  as  sex 
and  ago  are  nearly  alv/ays  independent,  for  others  there  is  sound  evidence 
from  past  studies  to  indicate  the  direction  of  the  relationship,  and  measure¬ 
ments  of  some  occur  before  the  measurements  of  other  variables  in  time.  A 
program  for  computing  adjacency  matrices  which  also  takes  direction  of  in¬ 
fluence  into  account  is  included  in  the  DIGRAF  system. 

A  mainline  calling  program  reads  in  the  parameter  cards  and  adjacency 
matrix  before  calling  CLUSTR.  The  first  parameter  card  contains  the  number 
of  variables  in  columns  1  to  5»  right  justified;  the  second  card  contains 
any  titling  information  desired;  and  the  third  is  a  variable  format  card  for 
the  input  adjacency  matrix.  These  cards  go  directly  before  the  adjacency 
matrix  deck  which  is  followed  by  either  another  set  of  parameter  cards  and 
deck  or,  if  no  more  decks  are  to  follow,  a  blank  card.  CLUSTR  then  computes 


the  ctrong  components  of  the  digraph,  prints  them  out,  and  then  computes 
and  prints  the  weak  components*  The  subroutine  REPUN  is  used  to  calculate 
the  reachability  matrices  needed,  and  the  subroutine  PRINT  is  used  to  print 
the  adjacency  matrix  with  zeros  suppressed.  Output  from  tue  program  includes 
titling  information,  number  of  variables,  the  adjacency  matrix  without  zeros, 
and  the  strong  and  weak  components.  One  member  components  are  not  printed. 

The  cards  required  to  compute  the  components  of  the  example  problem  of 
Chapter  II  using  CLUSTR  are 
00007 

TEST  ADJACENCY  MATRIX 
(10X,7I2) 

ADJ  1  0100000 

ADJ2  1010000 
ADJ  3  0000000 

ADJ  4  0000000 

ADJ5  0110010 
ADJ  6  0000001 

ADJ  7  0000100 

Output  for  this  example  is  found  after  the  program  in  Appendix  B. 

CLUSTR  uses  26,400  words  of  memory  in  the  GE625  with  10,000  words  in 
block  common.  A  drum  or  disc  is  used  for  temporary  storage  during  computa¬ 
tion.  Time  required  is  less  than  .05  hours.  The  program  accepts  up  to  100 


variables 
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CHAPTER  IV:  AH  APPLICATION  OF  CLUSTR 

In  order  to  illustrate  the  use  of  CLUSTR  in  an  actual  study,  it  v.-as 
applied  to  some  data  collocted  by  the  author  in  the  spring  of  1968  on  the 
antecedents  of  temporary  depression.  The  study,  in  the  form  of  a  question¬ 
naire  given  to  students  in  beginning  psychology  classes  who  were  experien¬ 
cing  a  temporary  depression,  was  based  on  several  unpublished  studies  by 
Shelly  (1968)  which  indicated  that  people  soek  an  optimal  arousal  level  and 
are  happiest  with  doing  or  experiencing  things  which  contribute  to  reaching 
that  optimal  level.  For  instance,  to  a  bored  person  arousing  events  such 
as  listening  to  loud  music,  meeting  new  people,  and  engaging  in  a  controver¬ 
sial  discussion  may  be  very  pleasing  to  him}  for  someone  who  has  had  a  par¬ 
ticularly  hectic  day,  listening  to  soft  music  in  his  favorite  easy  chair  may 
be  what  pleases  him  most.  It  was  conjectured  that  students  in  the  midst  of 
a  temporary  depression  might  have  experienced  events  just  beforo  their  de¬ 
pression  which  had  caused  their  arousal  levels  to  be  less  than  ideal.  Ques¬ 
tions  were  asked  about  what  sorts  of  things  had  happened  in  the  twelve  hours 
preceding  the  temporary  depression  and  about  v;hat  sorts  of  things  they  liked 
in  general.  The  questionnaire  may  be  found  in  Appendix  C.  The  actual  data 
is  in  Appendix  D.  There  wore  52  questions  and  53  subjects. 

Similarity  matrices  of  two  types  were  computed  —  a  regular  Pearson  r 
correlation  matrix  and  one  based  on  the  omega-squared  statistic^.  Adjacency 

1.  Population  index  of  the  relative  reduction  in  the  variance  of  Y  given 
the  X  value  for  an  observation: 


(Hays,  1965) 
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mu  trices,  corrected  for  directional  relationships,  were  computed  based  on 
.3,  .4,  and  .5  cutoff  levels  for  the  correlation  matrix  and  on  *09,  .1,  arid 
.2  cutoff  levels  for  the  omega-square  matrix  and  the  variables  v/ere  clus¬ 
tered  using  the  program  CI.USTR. 

The  clusters  for  the  various  levels  of  correlation  ore  shown  in  Figure 
4  and  those  for  the  levels  of  omega-square  in  Figure  5*  Solid  lines  indi¬ 
cate  the  strong  components  and  broken  linos  the  weak  components.  The  actual 
clusters  may  be  found  in  Appendix  B. 

Among  the  patterns  which  can  be  seen  from  the  various  levels  of  clus¬ 
tering  are  two  or  three  large  clusters  with  several  smaller  clusters  both 
within  and  without  the  larger  clusters.  One  of  the  larger  clusters,  on  the 
right  in  Figures  1  and  2  seem3  to  include  mainly  general  attitudes  and  pre¬ 
ferences  reflecting  optimal  arousal  levels.  Among  the  smaller  clusters  in¬ 
cluded  is  one  concerned  with  the  nurabor  and  influence  of  close  friends  (40, 
45)  and  another  cluster  concerned  vdth  liking, dancing  and  controversial  dis¬ 
cussions  (35i35).  Another  interesting  cluster  is  the  weak  component  found 
in  the  correlation  clusters  which  links  controversial  discussions,  change 
in  opinion  about  Vietnam,  and  loudness  of  music  (17,36,42). 

A  second  large  cluster  (lover  loft)  is  concerned  vdth  events  of  the  day 
preceding  the  temporary  depression.  Among  the  variables  which  seem  to  go  to¬ 
gether  in  thic  cluster  are  those  of  excitement,  novel  experiences,  change  in 
relationship  vdth  a  friend,  and  number  of  pleasant  things  that  happened  (7, 
15t5»15»24,26,4).  A  second  group  has  to  do  vdth  boredom,  relaxation,  amount 
of  sleep,  and  number  of  unpleasant  things  (25,27,8,11,14).  The  first  seems 
to  deal  with  increasing  arousal  and  the  presence  or  absence  of  pleasant  ex¬ 
periences.  The  second  group  deals  more  vdth  decreasing  arousal  and  presence 
or  absence  of  unpleasant  experiences.  The  relative  independence  of  pleasant 
and  unpleasant  events  is  an  observation  also  made  by  Shelly  (1967). 
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I 


Figure  4.  Clusters  of  the  adjacency  matrices  based  on  cutoff  levels  of  .3, 
.4,  and  .5  of  the  correlation  matrix.  Solid  lines  represent  strong  compon¬ 
ents,  broken  linos  weak  components.  Innermost  clusters  represent  the  .5  cut¬ 
off  level,  outermost  clusters  represent  the  .3  cutoff  level. 
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Figure  5«  Clusters  of  the  adjacency  matrices  based  on  cutoff  levels  of  .09 
.1,  and  .2  of  the  omega-squared  matrix.  Solid  lines  represent  strong  com¬ 
ponents,  broken  lines  weak  components.  Innermost  clusters  are  those  based 
on  the  .2.  cutoff  level. 
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A  third  large  cluster  links  severity,  length,  and  course  of  the  depres¬ 
sion  (2,^7-52).  The  small  cluster  (16,19)  which  occurs  in  both  groups  of 
clusters  seems  to  indicate  that  trouble  v/ith  a  friend  is  a  separate  reason 
for  a  temporary  depression  independent  of  other  changes  in  arousal  level. 

A  more  thorough  study  of  the  clusters  and  the  similarity  matrices  would 
yield  information  as  v/ell  about  the  direction  (in  the  sense  of  positive  or 
negative  correlations)  and  strength  of  relationships.  It  should  be  remem¬ 
bered  that  the  difference  betv/cen  strong  and  weak  components  is  not  in  the 
strength  of  relationships,  but  in  the  logical  progression  of  the  relation¬ 
ships.  Relative  strength  of  relationship  can  be  discovered  by  comparing 
the  clusters  made  at  different  cutoff  levels,  however.  It  can  be  seen  that 
CLUSTR  can  provide  quite  useful  information  about  the  patterns  of  relation¬ 
ships  v/hich  exist  in  the  system  as  it  is  represented  by  the  data. 
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APPENDIX  A 


Matrix  operations 
Addition 


C  -  A  *  B  *  ♦  b4j  =  c13) 

Cross  product  (element-wise  multiplication) 

C  *  A  X  »  -  <Vij  *  Oy) 

Multiplication  -  number  of  rows  =  number  of  columns  =  n 
C  =  AB  =  (a41a^  +  a49b34  +  ...)  =  (£  aikbkj  »  cij) 


il  lj  i2  2j 


k=l 


Boolean  arithmetic 


1  +  0  =  0 

0  +  0  =  0 

1  +  1  =  0 


1  =  0 


Exactly  the  same  as  regular  arithmetic  except  1  +  1  =  1. 


Adjacency  matrix 


=  1  if  €  digraph 

=  0  if  l/j,  Uj  €  digraph 


Reachability  matrix 
R  =  A  +  A2  +  A5  + 


•••  with  diagonal  elements  =  1. 


Computation  of  reachability  matrix  from  adjacency  matrix. 

Since  there  are  seven  points  in  the  example  problem,  the  greatest 
number  of  lines  which  are  required  to  reach  one  point  from  another  is  six. 
Since  the  (n)th  power  of  an  adjacency  matrix  gives  the  paths  of  length  n  in 
the  corresponding  digraph,  the  greatest  number  of  powers  that  will,  nped  to 
be  taken  in  order  to  find  all  the  paths  in  the  digraph  is  six. 
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The  entries  of  1  in  the  original  adjacency  matrix  A  are  the  paths  of 


length  1, 


1  2  3  4.  5  6  7 


1 

2 

i  3 

A  =  A  =  paths  of  length  1=4 

5 

6 

7 


0  1  0  0  0  0  0 
1  0  1  0  0  0  0 
0  0  0  0  0  0  0 
0  0  0  0  0  0  0 
0  110  0  10 
0  0  0  0  0  0  1  I 
0  0  0  0  1  0  oj 

2 

Multiplying  A  by  itself  we  get  A  ,  the  paths  of  length  2.  The  first 
two  entries  are 

all  =  allall  +  a12a21  +  a13a31  +  al4a4l  +  ai5a5l  +  al6a6l  +  a17a71 

=  0  +  1  +  0  +  0  +  0+  0  +  0=1 


a12  =  allal2  +  a12a22  +  ai3a32  +  a 


l4a42  +  a15a52  +  al6a62  +  a17a?2 


=  0  +  0  +  0  +  0  +  0  +  0  +  0  =  0 


1  2  3  4  5  6  7 


A  =  paths  of  length  2  = 


A  =  paths  of  length  3  = 


paths  of  length  4  = 


0  0 
0  0 


Continuing  to  take  powers,  we  find 

1  2  3  4  5  6  7 


1  2  3  4  5  6  7 
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123^567 

if  0  1  0  0  0  0  o' 

2  1010000 

s  30000000 
t?  *  paths  of  length  5C^  0000000 

5  1110001 

6  0100100 

7  1110010 
u 


Adding  the  powers  using  Boolean  addition 

123^567 

lllllOOOo' 

2  1110000 

0  x  L  2  ^  4  S  30000000 

A  +  A  +  A^  +  A  =  A  +  A  +  Ap  +  A  +  A'  »  4  0000000 

5  1110111 
6  1110111 
7  1110111 

•> 


5 

Since  adding  A  gives  exactly  the  same  matrix  as  before  it  was  added,  it 
is  unnecessary  to  continue  the  process  of  taking  powers  and  adding.  When 

t 

the  diagonal  elements  are  set  to  1,  the  reachability  matrix  is  complete. 

1  2  3  4  5  6  7 

if  1  1  1  0  0  0  o' 

2  1110000 

3  0010000 

Re  4  0001000 

5  1110111 

6  1110111 

7  1110111 
^  m 


APPENDIX  B: 


THE  PROGRAM  CLUSTR  AND  SAMPLE  OUTPUT 


o  o  o  o  o  o  o 


(1) 


HA  INLINE  TO  CALL  CLUSTR 

PARAMETER  CARDS 

1  COL  1-5  NVAR 

2  TITLE 

3  VARIABLE  FORMAT 

DIMENS IOW  IA( 100, 100) ,  KTITLE(lA) ,FMT( 14) 
COMMON  /Q/IA  /R/WVAR,  F TITLE 

1  FORMAT ( 15) 

2  F0RI-AT(13A6,A2) 

8  read  i.nvar 

I^WVAR.EQ.O)  GO  TO  9 
READ  2.KTITLE 
READ  2 , FMT 
DO  3  1=  1 ,  NVAR. 

3  READ  FMT,  (IA(I,J),J  =  l.NVAR) 

CALL  CLUSTR 

GO  TO  8 

9  STGF 
END 


ooo  oooooo 


(2) 


CCLUSTR  COMPUTATION  OF  STRONG  AND  WEAP  COMPONENTS  (  F  DIAGRAPH 
WRITTEN  BY  BARBARA  CROW  JUNE,  1968 


THIS  PROGRAM  COMPUTES  AND  LISTS  THE  MEMBERS  OF  THE  STRONG  AND 
WEAK  COMPONENTS  OF  A  DIGRAPH. 

INPUT  IS  AW  ADJACENCY  MATRIX . 

SUBROUTINE  CLUSTR 

DIMENSION  IA(100,100),  WEAK( 100, 100) ,KTITLE( 14) , 

1 FMT ( 3 ) , FMS (3) , I COMP ( 100 ) , N VZC ( 1 00 ) , IR( 100 , 100 ) , S TRONG ( 100 , 100 ) 
COMMON  /Q/IA  /R/NVAR.KTITLE 
INTEGER  WEAK ,  S  TRONG ,  FMT ,  1-MS  ,  FMR 
EQU I VALENCE  (WEAK .STRONG , IR) 

DATA ( FiiT(  I )  ,  I  =1 , 3  )/6H(  10X,  ,6P1000000,6HI4)  / 

DATA  FMR/6PK ,  0000/ 

DATA  NW/07777/ 

DATA  (NVEC(I),  I  -  1,100)/100*0/ 

DATA((IR(I,J),  J  =  1,100), I  =  i,L00)/L0000*0/ 

PRINTING  TITLE  AND  INPUT  MATRIX 
CALL  PRINT( IA , KTITLE , NVAR , 0 ) 

COMPUTING  STRONG  COMPONENTS 
COMPUTE  REACHABILITY  MATRIX  OF  ORIGINAL  DIGRAPH 
CALL  RERUN ( NVAR , IR ) 

C  MULTIPLY  ELEMENT  WISE  IR  AND  ITS  TRANSPOSE 

DO  50  I  a I, NVAR 

DO  50  J  = I, NVAR 

S TRONG ( I , J)  =  IR( I , J )* IR( J , I ) 

50  CONTINUE 

C  PRINT  STRONG  COMPONENTS 

NCP  =  0 
WRITE (6 ,51) 

51  FORMA T(//20X,3 6 HTHE  STRONG  COMPONENTS  OF  THE  DIGRAPH//) 

C  SET  UP  VECTOR  OF  ELEMENTS  OF  THE  COMPONENT 

DO  55  I  =  1 , NVAR 
I F( NVEC ( I ) . EQ . 1 )  GO  TO  55 
C  INITIALIZE  VECTOR  ICOMP  AND  K 

DO  56  L  =  I, NVAR 

56  ICOMP (L)  =  0 
K  =  0 

C  FORM  VECTORS  OF  ELEMENTS  OF  COMPONENTS 

DO  57  J  =  I, NVAR 
IF(STRONG( I , J) , EQ . 0)  GO  TO  57 
K  =  i:  +  i 

ICCMP(K)  =  J 
NVEC (JO  =  1 

57  cu/rii.ui; 

C  ELIMINATE  ONE  MEMBER  COMPONENTS 

IF(K.EQ.l)  GO  TO  55 
C  SET  UP  PART  OF  FORMAT 

NCP  =  NCP  +  1 
FMS(l)  =  FMT(l) 

FMS  (  3  )  =  FMT(3) 


on  o  o  o  o  o 


(3) 


C  FINISH  FORMAT 

IF(K.LT.IO)  GO  TO  58 
C  TWO  DIGIT  FIELD  i-RCBLEM 
NQ  =  NUMB (K, 6) 

NG  =  AND ( NW , fk) ) 

FMS ( 2 )  =  FMR  =  NG 
GO  TO  59 

58  FMS(2)  =  FMR  +  I- 

C  PRINT  COMPONENT 

59  WRITE(6 ,42)  HCP 

42  FORMAT (1R0.9X, 2 9H VARIABLES  IN  STRONG  COMPONENT, 13) 

C  SET  UP  LINE  TO  PRINT 

IS  =  MIN0(I  , 30) 

DO  13  J1  -  1,1  ,  IS 
J2  =  MINC( Jl+IS-1 .P  ) 

WRITE (6 , FMS )  (ICCMP(J).J  -  J1.J2) 

13  CONTINUE 

55  CONTINUE 

COMPUTING  WEAK  COMPONENTS 
I N I T I AL I2E  N VEC , IS UM , W EAK 
DO  30  I  =  l.NVAR 
30  WEAK ( I ,  J )  -  0 
FMS  (2)  =0 

ADD  IA  AND  ITS  TRANSPOSE  TO  FORM  SYMMETRIC IZED  DIGRAPH 
DO  10  I  =  l.NVAR 
DO  10  J  =  l.NVAR 
IA(I.J)  =  IA( I , J )  +  IA( J , I ) 
if( ia( I , J) , GT. 1)  IA( I , J)  =  1 
10  CONTINUE 

COMPUTE  REACHABILITY  MATRIX  OF  SYMETRICIZED  DIGRAPH 
CALL  RE  t U  N  (  in' VAR ,  WEAK  ) 

PRINT  OUT  WEAK  COMPONENTS 
NCP  =  0 
WRITE (6 , 16) 

16  FORMAT ( //2.0X , 34HTHE  WEAK  COMPONENTS  OF  THE  DIGRAPH//) 

SET  UP  VECTOR  OF  MEMBERS  OF  THE  COMPONENT 
ENTRIES  OF  I  IN  ROW  GIVE  MEMBERS  OF  THE  SAME  COMPONENT 
DO  22  I  =  l.NVAR 
IF(NVEC(I).EQ.l)  GO  TO  22 
C  INITIALIZE  VECTOR  ICoMP  AND  K 

DO  19  L  =  l.NVAR 

19  ICOMP (L)  =  0 
K  =  0 

C  FORM  VECTORS  OF  MEMBERS  OF  EACH  COMPONENT 

DO  20  J  =  l.NVAR 
IF(WEAL(I,J) .EQ.l)  GO  TO  20 
C  K  IS  THE  NUMBER  OF  ELEMENTS  IN  A  COMPONENT 

K  =  K  +  1 
ICOMP (K)  =  J 
NVEC(J)  =  1 

20  CONTINUE 

C  ELIMINATE  ONE  MEMBER  COMPONENTS 

IF(K.EQ.l)  GO  TO  22 
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C  FINISH  FORMAT 

wet  =  NCr  +  1 
IF(k.LT.LO)  GO  TO  25 

C  ELIMINATE  PROBLEMS  OF  TWO  DIGIT  FIELD  WIDTH 

NQ  =  NUMB (k, 6) 

NG  =  AND ( WW , NO ) 

FMS(2)  *  PMT(2)  +  NG 
GO  TC  26 

25  FMS(2)  =  FMT ( 2 )  +  k 

C  PRINT  COMPONENT 

26  WRITE (6, 44)  NCP 

44  FORMAT( 1HO ,  9k ,  2 7HVARIABLES  IN  WEAK  COMPONENT, 13) 

C  SET  UP  LINE  TO  PRINT 

IS  =  MINo(K, 30) 

DO  12  J1  =  1 ,  K ,  IS 
J2  =  MINO(  Jl+IS-1  ,I<) 

WRITE (6 , FMS )  (ICOMP(J).J  =  J1.J2) 

12  CONTINUE 

22  CONTINUE 
RETURN 
END 


o  o  o  o  o  o 
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RE  PUN  CALCULATION  OF  AEACi 'ABILITY  OK  N-kEaCHABILITY  MATRIX 

TiilS  PROGRAM  CALCULATES  A  REAOHA.'.  ILITY  MATRIX  FRuM  AW  ADJACENCY 
MATRIX.  THE  VALUE  OF  N  FOR  AW  W-REACWAL ILITY  MATRIX  MAY  BE 
SPECIFIED.  REACHABILITY  MATRIX  HAY  BE  PRINTED  CR  PUNCHED . 

SUBROUTINE  REPUW(NIT , IJ) 

DIMENSION  IA(I00, 100) , IJ( 100, 100) ,  IQ( 100, 100) ,kTITLE(l4) 

COMMON  /A/IA  /R/NVAR, KTITLE 
REWIND  I 
DO  3  I  =  1  ,NVAR 
DC  3  J  =  l.NVAR 
C  ABSOLUTE  VALUE  OF  IA 
IA( I , J)  =  IADS ( IA( I , J) ) 

C  INITIALISE  IJ  AND  IQ  AND  DRUM 
IQ(I.J)  =0  . 

3  IJ(I,J)  =  IA(I,J) 

WRITE  (1)  ((IA(I,J),J  =  l.NVAR), I  =  I.NVAR) 

REWIND  1 

C  L  IS  THE  NUMBER  OF  BOOLEAN  -POWERS  TAREN 
L  •*  0 
NT  =  NIT-1 
DO  9  R  =  1  (NT 
L  =  L  +  1 

C  COMPUTE  BOOLEAN  POWERS  OF  IA 
DO  5  I  =  l.NVAR 
DO  5  J  =  l.NVAR 
DO  4  M  =  l.NVAR 

4  IQ(I.J)  =  IQ(I,J)  +  IA(I,M)*IJ(M,J) 

5  IF(IQ(I,J).GT.l)  IQ(I,J)  =  I 

C  READ  IN  REACHABILITY  MATRIX  FROM  DRUM 

DC  1  I  =  l.NVAR 
DC  I  J  =  I.NVAR 

1  IJ(I,J)  =  0 

READ  (I)  ((IJ(I.J).J  =  I.NVAR).  I  =  l.NVAR) 

REWIND  I 

C  CHECK  FOR  CONVERGENCE 

DO  6  I  »  I.NVAR 
DO  6  J  =  I.NVAR 

IF( IQ( I , J) . EQ . L .AND. IJ( I, J) .EQ.O)  GO  TO  7 

6  CONTINUE 
GO  TO  12 

C  SUM  POWERS  OF  IA 

7  DO  2  I  =  I.NVAR 
DO  2  J  =  I.NVAR 

IJ(I.J)  =  IJ(I,J)  +  IQ(I,J) 

2  IF( I J( I , J ) . GT . I )  IJ(I.J)  =  1 

C  CHECK  FOR  COMPLETED  N-REACMB  ILITY  MATRIX 
IF(K.EQ.NT)  GO  TO  12 
C  STORE  MATRIX  ON  DRUM 

WRITE  (I)  ((IJ(I,J),J  =  I.NVAR),  I  =  l.NVAR) 

REWIND  I 
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C  REINITIALISE  IJ  AIM'D  IQ 
Do  S  I  =  I ,  I:  VAR 
DO  8  J  =  l.NVAK 
IJ(I,J)  =  0 
IJ(I,J)  =  IQ(I.J) 

8  Iq( I , J)  =  0 

9  CONTINUE 

C  SET  DIAGONALS  EQUAL  TO  1 

12  DC  13  I  =  l.NVAll 

13  IJ(I,I)  =  1 

RETURN 

END 


I 
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C PRINT 

C 

C  This  SUBROUTINE  SUPRESSES  NUMBER  LABELED  hi.Lk  AND  PRINTS  GUT 

C  MATRIX  WITH  BLANKS  IN  THOSE  SPACES.  II  ONLY 
SUBROUTINE  pRIWT(IAJ , KT ITLE , NVAR , NBLK ) 

DIMENS ION  IAJ (100,1), KTITLE( 14 ) , FORM( 105 ) 

INTEGER  ALEFT .ARIGHT , AS PEC , BLANKS , X3SPEC , AS PECO, ^oRK 
DATA  ALEFT , ARIGHT , ASPEC , ISPECM , BLANKS ,X3SPEC , ISPEC , IS  PECO  .ASPECo/ 
16H(  ,6H)  , 6HA1 ,  ,  6HI1,  ,6H  ,6h3X,  ,biil3, 

26I1II  ,  6HAI  / 

C  SET  UP  OPENING  AND  CLGSING  PARENTHESES  FOR  OUTPUT  FORMAT 
FORM ( 1 )  =  ALEFT 
FORM (NVAR  +  5)  =  ARIGHT 
(2)  =  X3SPEC 
FORPi  ( 3  )  =  ISPEC 
FORM(4)  =  X3SPEC 

7  FORMAT ( IHI ,  25X,  I3A6 ,  A2) 

V/RITE(6 , 7 )  (KTITLE(I),  1=1,14) 

8  FORMAT  ( IHO , 2 5X , 2 1HNUMBER  OF  VARIABLES  =  13) 

WRITE (f , 8)NVAR 

5  FORMAT ( 1H0 , 8X , IHI , 8X , 2HI0 , 8X , 2H20 , 8X , 2h30 , 8X , 2h40 , 8X , 2h60 , 8X , 
12H60 , 8X , 2H70 , 8X , 2H80 , 8X , 2h90 , 8X , 3HI00// ) 

WRITE(6,5) 

DO  30  I  =  I, NVAR 

MVAR  =  NVAR- I 

DO  31  J  =  I, MVAR 

IF( IAJ ( I ,  J  )  .  EQ  . NBLK)  GO  TO  32 

F0«M(J+4)  =  ISPECM 

GO  TO  31 

32  F0RM(J+4)  ASPEC 

IAJ(I.J)  =  BLANKS 

31  CONTINUE 

IF  ( IAJ  (I,  NVAR).  EQ.  NBLK)  GO  TO  29 
FCRM(NVAR+4)  =  ISPECO 
GO  TO  28 

29  FDRK(NVAR+4)  =  ASPECO 
IAJ (I, NVAR)  =  BLANKS 

C  WRITE  OUT  ARRAY  UNDER  CONTROL  OF  COMPUTED  FORMAT 

28  WRITE(6 , FORM)  I,  (IAJ(I,J),J  =  l.NVAR) 

DO  11  J  =  l.NVAR 

11  IF(IAJ( I, J).EQ. BLANKS)  IAJ(I,J)  =  0 

30  CONTINUE 
RETURN 
END 
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TEST  ADJACENCY  MATRIX 


NUMBER  OF  VARIABLES  =  7 


10  20  30  40  50 
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THE  STRONG  COMPONENTS  OF  THE  DIGRAPH 


VARIABLES  IN  STRONG  COMPONENT  I 
1  2 

VARIABLES  IN  STRONG  COMPONENT  2 
5  6  7 

THE  WEAK  COMPONENTS  OF  THE  DIGRAPH 

VARIABLES  IN  WEAF  COMPONENT  I 
1  2  3  5  6  7 
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APPENDIX  C: 


QUESTIONNAIRE 


ANTECEDENTS  OF  TEMPORARY  DEPRESSION 


Instructions 


Everyone  occasionally  goes  through  temporary  depressions.  This  study 
is  an  investigation  of  some  of  the  factors  that  may  be  associated  with  de¬ 
veloping  a  temporary  depression.  Try  to  be  as  careful  as  possible  in  fil¬ 
ling  out  the  attached  questionnaire. 

You  are  to  keep  the  attached  questionnaire  until,  as  happens  to  almost 
everyone,  you  go  through  a  temporary  depression.  There  is  no  need  to  re¬ 
turn  this  questionnaire  quickly;  if  you  go  through  the  rest  of  the  semester 
without  a  temporary  depression  you  will  still  receive  one  credit.  If  you 
do  experience  a  temporary  depression,  as  you  are  likely  to,  then  fill  out 
the  attached  questionnaire  sometime  during  the  depression. 

After  you  have  completed  the  questionnaire  bring  it  to  the  Social  Psy¬ 
chology  Secretary  in  Room  60?  PVaser.  The  secretary  will  give  you  one  credit 
for  your  participation. 

When  the  results  have  been  analysed  (late  this  summer  or  early  fall) 
you  may  pick  up  a  copy  of  the  results  in  the  Social  Psychology  Office. 

Thank  you  very  much  for  your  cooperation. 


ANTECEDENTS  OF  TEMPORARY  DEPRESSION  QUESTIONNAIRE 


1.  Sex 

1*  male 

2.  female 

2.  How  long  has  it  been  since  this  temporary  depression  began?  (Use  the 
time  you  think  it  began,  not  when  you  first  noticed  it.) 

1.  less  than  two  hours 

2.  two  to  four  hours 

3.  four  to  six  hours 

4.  one  day 

5.  two  days 

6.  three  days 

7.  four  days  to  a  week 

8.  more  than  a  week 

3.  What  time  of  day  did  this  temporary  depression  begin? 

1.  morning 

2.  afternoon 
3»  evening 

4.  late  at  night 

Most  of  the  following  questions  refer  to  the  twelve  waking  hours  pre¬ 
ceding  the  temporary  depression: 


-39- 


4.  How  happy  wore  you  in  the  twelve  waking  hours  preceding  this  temporary 
depression? 

1.  not  as  happy  as  usual 

2.  about  as  happy  as  usual 

» 

3.  a  little  happier  than  usual 

4.  much  happier  than  usual 

5.  Did  you  do  anything  in  the  twelve  waking  hours  preceding  the  depression 
that  you  usually  don't  do? 

1.  no,  nothing  unusual 

2.  something  somewhat  unusual 

3.  something  quite  unusual 

6.  Did  you  meet  anyone  you  hadn't  met  before  in  these  twelve  preceding  hours? 

1.  yes 

2.  no 

7 •  How  exciting  were  the  twelve  waking  hours  preceding  the  temporary  depres- 


sion? 

1  2 

3 

4  5 

6 

7 

8  9 

extremely 

dull 

extremely 

exciting 

8.  How  relaxing 

were 

they? 

1  2 

3 

4  5 

6 

7 

8  9 

extremely 

relaxing 

extremely 

tense 

9«  How  often  did  you  engage  in  arguments  (friendly  or  otherwise)  in  the 
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twelve  waking  hours  preceding  the  depression? 

1.  not  at  all 

2.  once 

3.  twice 

4.  three  times 

5»  more  than  three 

10.  How  much  did  you  have  to  do  that  day? 

1.  very  little 

2.  not  too  much 

3.  quite  a  bit 

4.  a  great  deal  to  do 

11.  How  much  sleep  did  you  get  the  night  preceding  the  temporary  depression? 

1.  much  less  than  usual 

2.  a  little  less  than  usual 

3.  about  the  same  as  usual 

4.  a  little  more  than  usual 
5«  a  lot  more  than  usual 

12.  How  many  times  did  you  feel  really  relaxed  in  the  twelve  waking  hours 
preceding  the  depression? 

1.  not  at  all 

2.  once 
3»  twice 

4.  three  times 


5*  more  than  three  time3 
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13.  How  many  times  did  you  feel  very  excited? 

1.  not  at  all 

2.  once 

3.  twice 

4.  three  times 

5.  more  than  three  times 

14.  How  many  times  did  you  feel  really  bored? 

1.  not  at  all 
2*  once 
3«  twice 

4.  three  times 

5.  more  than  three  times 

15.  Did  your  relationship  to  one  of  your  friends  change  for  the  better  in 
the  tv/elve  waking  hours  preceding  the  temporary  depression? 

1.  no 

2.  yes,  somewhat 

3.  yes,  our  relationship  changed  a  great  deal  for  the  better 

16.  Did  your  relationship  with  a  friend  change  for  the  worse? 

1.  no 

2.  yes,  somewhat 

3.  yes,  a  great  deal  for  the  worse 

1?«  How  many  times  did  you  engage  in  controversial  discussions  in  the  twelve 
waking  hours  preceding  the  depression? 


!•  not  at  all 
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2.  once 

3.  twice 

4.  three  times 

3*  more  than  three  times 

18.  How  many  different  places  did  you  go  in  the  twelve  hours  preceding  the 
depression? 

1.  one  to  two 

2.  three  to  four 
3»  five  to  ten 

4.  ten  to  twenty 

5.  more  than  twenty 

19.  During  the  preceding  twelve  hours  did  a  person  of  the  opposite  sex 
cause  you  much  trouble? 

1.  none 

2.  a  little 

3.  quite  a  bit 

4.  a  great  deal 

20.  Did  you  do  better  than  expected  on  something  you  did  in  the  twelve  hours 
preceding  the  depression  (or  receive  the  results  of  something  you  had  done 
earlier)? 

0.  doesn't  apply 

1.  no,  no  better  than  expected 

2.  somewhat  better  than  expected 

3.  much  better  than  expected 
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21.  Did  you  do  worse  than  expected  on  something? 

0.  doesn't  apply 

1.  no,  no  worse  than  expected 

2.  somewhat  worse  than  expected 
3#  much  worse  than  expected 

22.  How  many  people  were  you  with  during  most  of  the  twelve  waking  hours 
preceding  the  temporary  depression? 

1.  none,  I  was  alone  most  of  the  time 

2.  one  or  two  other  people 

3#  three  to  five  other  people 

4.  five  to  ten 

5.  ten  to  twenty 

6.  more  than  twenty 

23#  The  number  of  people  you  were  with  was 

1.  less  than  usual 

2.  about  the  same  as  usual 
3#  more  than  usual 

24.  How  many  pleasant  things  happened  to  you  in  the  twelve  waking  hours 
preceding  the  depression? 

1.  none 

2.  a  few 
3#  some 

4.  quite  a  few 
5#  many 
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25 •  How  many  unpleasant  things  happened? 

1.  none 

2.  few 

3.  6ome 

4.  quite  a  few 
5«  many 

26.  Was  the  number  of  pleasant  things  greater  or  smaller  than  usual? 

1.  less  than  usual 

2.  aoout  same  as  usual 
3#  more  than  usual 

27.  Was  the  number  of  unpleasant  things  greater  or  smaller  than  usual? 

1.  less  than  usual 

2.  about  same  as  usual 

3.  more  than  usual 

28.  When  the  temporary  depression  began  to  come  on  were  you 

1.  in  a  very  familiar  place 

2.  in  a  somewhat  familiar  place 

3.  in  a  place  you  had  never  been  before 

29.  How  many  associations  does  this  place  arouse  for  you? 

123456789 

very  a  great  many 

few  associations 

30.  Were  there  any  specific  events  which  you  feel  precipitated  the  tempor¬ 
ary  depression? 


-45- 


1.  don't  know 

2.  nothing  in  particular 

3.  a  series  of  events 

4.  a  single  event 

The  following  are  very  general  questions  about  life  of  a  college  student 
at  KU. 

31.  To  what  extent  are  you  "going  with"  someone  of  the  opposite  sex? 

1.  not  at  all 

2.  dating  one  person  more  than  any  other 

3.  dating  only  one  person 

4.  engaged 

5.  married 

32.  How  often  do  you  go  places  to  meet  new  people? 

1.  several  times  a  day 

2.  several  times  a  week 

3.  once  a  week 

4.  once  every  two  weeks 

5.  less  than  every  tv/o  weeks 

33*  To  what  extent  do  you  enjoy  discussing  controversial  subjects? 

1.  extremely  well 

2.  quite  well 

3.  all  right  once  in  a  while 

4.  not  very  well 

5.  not  a'  all 


13 


1 
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34.  How  happy  is  your  life  at  KU  in  general? 

123456789 
miserable  extremely 

happy 

35»  Do  you  enjoy  dancing? 

1.  yes 

2.  sometimes 
3*  no 

36.  Do  you  prefer  loud  or  soft  music? 

1.  loud 

2.  in  between 

3.  soft 

37.  How  much  do  unfriendly  arguments  upset  you? 

1.  not  at  all 

2.  very  little 

3.  not  too  much 

4.  quite  a  bit 

5.  a  great  deal 

38.  To  what  extent  do  you  enjoy  working  on  problems  you  are  unlikely  to  be 
able  to  solve? 

1.  a  great  deal 

2.  once  in  a  while,  or  until  I  decide  it  can't  be  solved  easily 

3.  I'd  rather  stick  to  problems  I  can  solve 

39.  Do  you  dislike  movies  with  tragic  endings? 


-4?- 


1.  yes 

2.  no 

40.  How  many  close  friends  do  you  have? 

1.  none 

2.  one 

3.  two 

4.  three  to  five 

5.  five  to  ten 

6.  more  than  ten 

41.  Has  your  opinion  about  the  draft  changed  in  the  last  few  months? 

1.  yes 

2.  no 

42.  About  the  Vietnam  war? 

1.  yes 

2.  no 

43.  Do  you  prefer  parties  where  there  are 

1.  just  two  people 

2.  a  few  people 

3.  10  to  20  people 

4.  more  than  30  people 

44.  How  far  ahead  do  you  like  to  make  dates? 


1.  on  the  spur  of  the  moment 

2.  one  day 
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3.  two  days 

4.  a  week 

5.  two  weeks 

6.  more  than  two  weeks 

43*  When  you  make  decisions  do  you  think  about  what  close  friends  might  do? 

1.  yes 

2.  no 

46.  How  much  do  you  enjoy  a  good  meal? 

1.  a  great  deal,  one  of  my  favorite  things  to  do 

2.  quite  a  bit 

3.  okay 

4.  just  something  that  has  to  be  done  before  going  on  to  other  things 

To  be  answered  as  soon  as  possible  after  the  temporary  depression  is  gone. 

4?.  How  long  did  the  temporary  depression  last  all  together? 

1.  less  than  two  hours 

2.  two  -  four  hours 

3.  four  -  six  hours 

4.  a  day 

5.  two  days 

6.  three  days 

7.  four  days  -  week 

8.  two  weeks 


9.  more  than  two  weeks 
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48.  Compared  with  other  temporary  depressions  you  have  experienced ,  how 
severe  was  this  one? 

1.  less  severe  than  most 

2.  about  same 

3«  more  severe  than  most 


49.  -  52.  To  help  describe  the  course  of  the  temporary  depression,  please 
use  the  chart  below.  Start  at. the  left  side  of  the  chart  and  draw  a  contin¬ 
uous  curve  showing  how  you  felt  during  the  temporary  depression, 
happier  9  ' 
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APPENDIX  D: 


DATA,  ADJACENCY  IUTRICES  AND  CLUSTERS 
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